Reputation Extraction Using Both Structural and Content Information

نویسندگان

  • H. Hasegawa
  • M. Kudo
  • A. Nakamura
  • Hiroyuki Hasegawa
  • Mineichi Kudo
  • Atsuyoshi Nakamura
چکیده

We propose a new method of extracting texts related to a given keyword from Web pages collected by a search engine. By combining structural pattern matching and text classification, texts related to a given keyword such as reputations of a given restaurant can be extracted automatically from Web pages in unfixed sites, which is impossible by conventional wrappers. According to our cross validation results on extracting reputations of a given Ramen shop from Web pages collected by a search engine, our method achieved 79.3% precision and 56.6% recall by allowing acceptable errors.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analyzing the Structural and Outward Features and Expected Activities of a Service-Extension Agricultural Website in Iran

Background and Aim: This study aimed to identify and Analyzing the structural and outward features and expected activities of a service-extension agricultural website, based on the views of experts in the field of agriculture and related sciences and webmasters and blogs in Iran. Method: The methodological approach was a descriptive and survey study. The statistical population of the study cons...

متن کامل

Identifying Credibility Criteria in Scholarly Communication (Reading and Citing) form the Standpoints of Faculty Members of Kharazmi University

Background and Aim: In effect, every scientific endeavor consisted of scientific communication and scientists’ involvement in particular field of study; and scientific board members as the most outstanding elements play a key role in scientific productions. Therefore, a constructive scientific communication requires obtaining credible and valid information. In so doing, this study tries to inve...

متن کامل

Structural Model of Brand Ambidexterity Impact on Brand Commitment through Brand’s Performance, Image and Reputation

Brand ambidexterity strategies help organizations improve their capabilities and performance and simultaneously discover new opportunities. The purpose of this study is to investigate the effects of brand ambidexterity strategies on brand commitment through brand’s performance, image and reputation. The statistical population of this research were the users of Pishgaman Company. Random sampling...

متن کامل

Content Contribution for Revenue Sharing and Reputation in Social Media: A Dynamic Structural Model

This study examines the incentives for content contribution in social media. we propose that exposure and reputation are the major incentives for contributors. Besides, as more and more social media web sites offer advertising-revenue sharing with some of their contributors, shared revenue provides an extra incentive for contributors who have joined revenue-sharing programs. we develop a dynami...

متن کامل

Data Extraction using Content-Based Handles

In this paper, we present an approach and a visual tool, called HWrap (Handle Based Wrapper), for creating web wrappers to extract data records from web pages. In our approach, we mainly rely on the visible page content to identify data regions on a web page. In our extraction algorithm, we inspired by the way a human user scans the page content for specific data. In particular, we use text fea...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005